The SYSTERS Protein Family Web Server: Shortcut from large-scale sequence information to phylogenetic information SYSTERS superfamily 114462 comprises most of the Cation efflux domain proteins in Arabidopsis thaliana

نویسندگان

  • Thomas Meinel
  • Eike Staub
  • Antje Krause
  • Hannes Luz
  • Stefanie Hartmann
  • Ute Krämer
  • Joachim Selbig
  • Martin Vingron
چکیده

With this poster [11], we present the SYSTERS protein family database, an attempt to classify all available protein sequences. In particular, we focus on the capability of the web interface to assist in in-depth analyses of special protein families. We demonstrate this by an analysis of a specific family of transmembraneous metal ion transport proteins characterised by the so called cation efflux domain. We show three strategies to query SYSTERS: 1. by protein domain name as defined in Pfam [1], 2. by a set of sequence database accession numbers, and 3. by SYSTERS superfamily ID. All strategies lead to the identification of relevant SYSTERS protein families with description annotations from the source database entries, links to sequences, annotated alignments, family consensus sequences, phylogenetic trees, phylogenetic profiles, functional annotations, and links to other sequence resources. The analysis presented here documents how SYSTERS can help to generate new hypotheses about the function and evolution of protein families.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The SYSTERS Protein Family Database in 2005

The SYSTERS project aims to provide a meaningful partitioning of the whole protein sequence space by a fully automatic procedure. A refined two-step algorithm assigns each protein to a family and a superfamily. The sequence data underlying SYSTERS release 4 now comprise several protein sequence databases derived from completely sequenced genomes (ENSEMBL, TAIR, SGD and GeneDB), in addition to t...

متن کامل

SYSTERS, GeneNest, SpliceNest: exploring sequence space from genome to protein

We have integrated the protein families from SYSTERS and the expressed sequence tag (EST) clusters from our database GeneNest with SpliceNest, a new database mapping EST contigs into genomic DNA. The SYSTERS protein sequence cluster set provides an automatically generated classification of all sequences of the SWISS-PROT, TrEMBL and PIR databases into disjoint protein family and superfamily clu...

متن کامل

WWW access to the SYSTERS protein sequence cluster set

SUMMARY We present a Web server where the SYSTERS cluster set of the non-redundant protein database consisting of sequences from SWISS-PROT and PIR is being made available for querying and browsing. The cluster set can be searched with a new sequence using the SSMAL search tool. Additionally, a multiple alignment is generated for each cluster and annotated with domain information from the Pfam ...

متن کامل

The SYSTERS protein sequence cluster set

The SYSTERS (short for SYSTEmatic Re-Searching) protein sequence cluster set consists of the classification of all sequences from SWISS-PROT and PIR into disjoint protein family clusters and hierarchically into superfamily and subfamily clusters. The cluster set can be searched with a sequence using the SSMAL search tool or a traditional database search tool like BLAST or FASTA. Additionally a ...

متن کامل

The SYSTERS protein family database: Taxon-related protein family size distributions and singleton frequencies

Based on the SYSTERS protein family database, we present taxon-related protein family frequencies and distributions. A set of taxon-related protein families is a subset of the whole family set with respect to one taxon, where taxon is not restricted to the species level but may be any rank in the taxonomy. We examine eight ranks in the lineages of seven organisms. A strong linear correlation is...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004